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Abstract  -  To  recognize  an  object  in  an  image,  an 
algorithm  must  identify  not  only  the  object  pixels,  but  also 
non-object  clutter  pixels.  Non-object  pixels  can  be 
assessed  with  a  priori  clutter  models  that  account  for  the 
varying  terrain  and  cultural  objects.  Radar  clutter  models 
have  been  well  developed;  however,  these  models 
typically  incorporate  a  single  distribution  to  capture 
background  effects.  In  this  paper,  we  propose  to  use  a 
fusion  of  distributions  using  an  additive  mixture  model  to 
characterize  various  background  clutter  information  so 
as  to  more  accurately  develop  a  clutter  model  useful  for 
object  recognition.  In  a  radar  example,  we  show  a  fused- 
distribution  using  a  Rayleigh  and  Pareto  model 
describing  the  average  and  heavy  tail  clutter 
characteristics. 
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1  Introduction 

Fusion  models  [1,  2],  such  as  the  User  Fusion  model, 
incorporate  six  levels  of  information  processing.  Level  0 
includes  a  priori  models  of  the  environment  and  targets, 
pre-defined  mission  objectives,  and  data  preprocessing. 
Level  1  is  object  assessment.  Level  2  and  3  are  situation 
assessment  and  threat  assessment.  Level  4  is  process 
refinement  and  level  5  is  User  refinement.  Since  the  User 
defines  the  location  and  objects  of  interest,  the  User  plays 
a  key  role  in  coordinating  target  recognition.  The  fusion 
system  must  account  for  known  and  unknown  clutter 


Figure  1.  User-Fusion  Model 
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responses  to  facilitate  timely  target  recognition.  The 
effective  and  efficient  user-fusion  interaction  includes 
clutter  modeling  of  a  predefined  location. 

Clutter  Modeling  is  well  developed  for  various  radar 
systems  and  environmental  conditions  [3].  While  there  is 
a  large  body  of  literature  on  radar  analysis,  most  of  the 
radar  parameters  are  based  on  empirical  application- 
specific  results.  Since  the  application  drives  the  analysis, 
so  does  the  environment  of  the  defined  location.  Clutter 
modeling  is  based  on  the  uncertainty  of  the  environment 
as  described  probabilistically.  [4]  Thus,  the  goal  of  this 
study  is  to  derive  a  clutter  probability  distribution  function 
(PDF)  to  enhance  radar  modeling. 

Radar  modeling  typically  includes  a  system  signal 
model  which  includes  a  state  and  an  associated  variance. 
[5]  Many  distributions  provide  state  and  variance 
estimates,  and  the  correct  model  is  one  that  captures  a 
majority  of  the  data.  For  instance,  a  Rayleigh  [3,  6,  7], 
Log-normal  [8,  9],  or  Weibull  distribution  [10,  11]  are 
used  to  capture  radar  clutter.  For  various  background 
environments  (i.e.  sea,  land,  and  air)  the  models  have  well 
been  developed.  Other  distributions  that  have  been 
proposed  for  describing  the  statistics  of  clutter  include 
Ricean,  [12]  Rice  squared,  [13]  gannna,  [14],  K- 
distribution  [15,  16],  log-normal-Rayleigh  [17]  and  log- 
Weibull.  [18]  A  single  parameter  model,  in  most  cases,  is 
not  appropriate  and  further  refinement  is  needed.  For 
instance,  at  a  seashore,  the  clutter  model  would  have  to 
model  both  the  sea  and  the  land,  and  if  the  object  of 
interest  transfers  from  the  sea  to  the  land,  then  the 
appropriate  clutter  model  would  have  to  be  determined.  A 
good  example  of  combining  distributions  of  data  is  a 
mixture  model  [17],  which  utilizes  expectation 
maximization.  Metropolis  Hastings,  or  Gibbs  sampling  to 
estimate  parameters. 

A  challenging  clutter  model  is  a  forest.  The  forest  can 
vary  in  density  and  height  based  on  the  amount  of  foliage 
present.  To  adequately  model  the  forest  clutter,  we  would 
like  a  model  that  represents  both  a  thick/dense  forest  as 
well  as  one  that  is  sparse/thin.  To  do  this,  we  propose  a 
fusion  of  distributions  by  additive  mixture  modeling  for 
the  analysis  which  captures  the  heavy-tail  clutter 
characteristics  of  the  data. 

This  paper  proposes  a  methodology  for  in  situ 
estimation  of  mixture  parameters  of  data  using  a  fusion  of 
distributions  and  applies  the  general  strategy  to  clutter 
modeling.  Section  2  details  radar  target  and  clutter 
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modeling.  Section  3  develops  the  general  theory  for  the 
distribution  fusion.  Section  4  shows  a  specific  example  of 
distribution  fusion  and  Section  5  draws  conclusions. 
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Next,  we  describe  standard  target  models. 


(4) 


2  Radar  Clutter  Modeling 

Skolnik’s  text,  Introduction  to  Radar  Systems,  [3]  has 
long  been  regarded  as  a  key  resource  for  various  radar 
modeling.  There  are  various  models  types,  such  as  noise, 
target,  and  clutter  models.  Noise  factors  include:  antenna, 
thermal,  and  transmission  factors.  These  radar  noise 
clutter  factors  contribute  to  the  false  alarms  in  signal 
detection. 

The  radar  equation  is  defined  as: [3] 

Pt  G  Ae  g 

(471)2  S^„ 

where  =  transmitted  power 

G  =  Antenna  Gain 
Ag  =  Antenna  effective  aperture,  m^ 
a  =  Radar  Cross  section  of  the  target,  m^ 

Smin  =  Minimum  detectable  signal,  W 

The  radar  designer  can  choose  all  the  parameters  except 
for  the  radar  cross  section  of  the  target  (since  these  are 
unknown).  Because  of  (1)  noise,  (2)  uncertainties  of  target 
cross  section,  (3)  signal  losses,  and  (4)  surface  and 
atmospheric  effects;  the  range  must  be  described 
probabilistically.  Thus,  the  correct  radar  range  will  be  a 
function  of  the  radar  return  probability  distribution 
function  (PDF)  (or  cumulative  distribution  function)  and 
probability  density  function  (pdf). 

2.1  Probability  Distributions 

Statistical  phenomena  can  be  presented  by  a  PDF,  P(x), 
and  a  pdf  p(x). 

X 

P(x)  =  J  p(x)dx  (2) 


2.2  Target  Models 

Target  models  are  based  on  a,  the  Radar  Cross  section  of 
the  target,  or  scattering  object,  a  represents  the  magnitude 
of  the  echo  return  signal  to  the  radar  by  the  target.  The 
definition  of  the  radar  cross  section  is 


a  = 


power  reflected  toward  source 
solid  angle 

incident  power  density  /  An 


=  4  7rR2 


(5) 


where: 

R  =  range  of  the  target 

Ej-  =  electric  field  strength  return 

Ej  =  electric  field  strength  incident  on  target 


Equation  5  assumes  that  the  radar  wave  is  far  enough 
away  to  be  considered  planar  and  that  the  power  is 
scattered  uniformly  in  all  directions.  Real  targets  do  not 
scatter  the  energy  uniformly  so  we  need  probability 
models  for  the  target. 

When  the  wavelength  is  large  compared  to  the  objects 
dimension,  scattering  follows  a  Rayleigh  distribution. 
When  the  wavelength  is  small  compared  to  the  objects 
dimension,  it  is  in  the  optical  region,  where  the  target  can 
be  detected.  The  different  regions  can  be  modeled  as  to 
the  contribution  of  spheres,  rods,  plates,  cones,  and  corner 
reflectors  based  on  associated  relationships  related  to  the 
frequency  and  wavelength. 

Complex  target  models  are  based  on  the  magnitude  and 
phase  of  the  signal.  The  target  models  of  interest  are  [3] 


(1)  Chi  square  of  degree  2m 


,  ,  m  f  f  m 


a>0(6) 


The  benefit  of  using  a  probability  model  is  that  many 
systems  are  effectively  described  in  terms  of  only  two 
parameters  -  a  mean  and  a  variance.  For  bandwidth 
considerations,  being  able  to  model  the  target  or  clutter 
with  only  two  parameters  is  highly  desirable. 

To  model  the  noise,  we  use  a  Gaussian  model,  where 
the  mean  (p)  noise  is  0,  and  m2  is  the  second  moment 

The  received  signal  for  radar  system  with  an  input  noise 
that  is  Gaussian  can  be  modeled  as  a  Rayleigh 
distribution. 


(2)  Rice 

P(cf)=T“expr-s  -“(1  +s^))io[2a  /;^s(l+s)  ) 
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where  s  is  the  ratio  of  the  radar  cross  section  of  the 
dominant  scatterers  to  the  cross  section  of  the  small 
scatterers  and  Iq  is  the  modified  Bessel  function. 


and  (3)  Log  Normal 
1 


p(a): 


KG  Sd 


exp 


(8) 


where  is  the  standard  deviation  of  Ln  (a  /  and 
is  the  median  of  a.  The  log  normal  is  used  for  high 
target/clutter  magnitudes. 

2.3  Clutter  Models 

Radar  clutter  is  unwanted  echoes  from  the  natural 
environment,  which  “clutter”  the  radar  and  challenge 
target  detection.  Clutter  includes  echoes  from  land,  sea, 
weather,  and  animals.  Clutter  is  spatially  distributed  and 
larger  in  physical  size  than  the  radar  resolution  cell.  Man¬ 
made  objects  (i.e.  buildings),  are  “point,”  or  discrete, 
clutter  echoes,  that  produce  large  backscatter.  Large 
clutter  echoes  can  mask  echoes  from  desired  targets. 
When  clutter  is  greater  than  receiver  noise,  clutter  signal 
processing  dominates. 

Echoes  from  land  or  sea  are  examples  of  surface  clutter. 
Echoes  from  rain  and  chaff  are  examples  of  volume 
clutter.  The  echo  magnitude  from  distributed  surface  clut¬ 
ter  is  proportional  to  the  area  illuminated.  To 
independently  model  the  clutter  echo  of  the  illuminated 
area,  the  clutter  cross  section  per  unit  area,  a®,  is  utilized: 


where  is  the  radar  cross  section  of  the  clutter  occupying 
an  area  A^.  is  a  dimensionless  quantity,  expressed  in 
decibels  with  a  reference  value  of  one,  and  is  also  known 
as  the  scattering  coefficient,  differential  scattering  cross 
section,  normalized  radar  reflectivity,  backscattering 
coefficient,  and  normalized  radar  cross  section  (NRCS). 

2.3.1  Surface  Clutter  Radar  Equation 

In  designing  surface  clutter  models,  we  develop 
different  models  based  on  the  grazing  angle  [19]  which 
can  be  effective  to  model  land  and  sea  clutter. 


Figure  2  -  Elevation  geometry  shows  the  extent  of  the 
surface  illuminated  by  the  radar  pulse,  (top)  plan  view 
showing  the  illuminated  clutter  resolution  cell  consisting 
of  individual,  independent  scatterers.[3] 

Low  Grazing  Angle.  Figure  2  depicts  a  radar 
illuminating  the  surface  at  a  small  grazing  angle,  y.  For 
low  grazing  angles  the  range  is  determined  by  the  radar 


pulse  width  t.  The  cell  width  in  the  cross-range  dimension 
is  determined  by  the  azimuth  beamwidth  0b  and  the  range 
R.  From  the  simple  radar  equation,  the  received  echo 
power  Pj.  is 


Pt  G  Ag  g 
(4nf  R* 


(10) 


where  P^  =  transmitter  power,  W 
G  =  antenna  gain 

Ag  =  antenna  effective  aperture,  m^ 

R  =  range,  m 

g  =  radar  cross  section  of  the  scatterer,  m^ 


To  model  target  echos  (rather  than  clutter),  we  let  P  j.  =  S 
(received  target  signal  power)  and  g  =  g^,  (target  cross 
section).  The  signal  power  returned  from  a  target  is  then 


S  = 


Pt  G  Ae  gt 
i4nf 


(11) 


When  the  echo  is  from  clutter,  the  cross  section  g 
becomes  g^  =  g^  A^,  where  the  area  A^  of  the  radar 
resolution  cell  is 


Ac  =  R  0B  (c  T  /  2)  sec  y  (12) 

with  0B  =  two-way  azimuth  beamwidth,  c  =  velocity  of 
propagation,  t  =  pulsewidth,  and  y  =  grazing  angle 
(defined  with  respect  to  the  surface  tangent).  The  area  Ac 
in  range  resolution,  is  (c  t  /  2),  where  the  factor  of  2  in  the 
denominator  accounts  for  the  two-way  propagation  of 
radar.  With  these  definitions,  the  radar  equation  for  the 
surface-clutter  echo-signal  power  C  is 

Pt  G  Ac  [0B  (c  T  /  2)  sec  y] 

(471)^ 

When  the  echo  from  surface  clutter  is  large  compared  to 
receiver  noise,  the  signal-to-clutter  ratio  is 


^  ^ _ 

C  “  g®  R  0B  (c  T  /  2)  sec  y 


(14) 


If  the  maximum  range,  Rmax^  corresponds  to  the  minimum 
discernible  signal-to-clutter  ratio  then  the  radar 

equation  for  the  detection  of  a  target  in  surface  clutter  at 
low  grazing  angle  is 

_  _ ^t _ 

Rmax  =  (S/C)nu„  a°  Gg  (c  T  /  2)  SCC  \|/ 

Notice,  the  range  appears  as  the  first  power  rather  than 
the  fourth  power  as  in  the  usual  noise-dominated  radar 
equation,  which  results  in  greater  variation  of  the 
maximum  range  of  a  clutter-dominated  radar.  When  there 


is  uncertainty  or  variability  in  the  range,  statistical 
representations  are  useful.  To  model  a  large  grazing  angle, 
we  need  to  address  different  radar  scattering. 

Variation  of  Surface  Clutter  with  Grazing  Angle. 
Figure  3  shows  the  general  form  of  surface  clutter  as  a 
function  of  grazing  angle.  There  are  three  different 
scattering  regions.  At  high  grazing  angles,  the  radar  echo 
is  due  mainly  to  reflections  from  clutter  that  can  be 
represented  as  individual  planar  facets  oriented  so  that  the 
incident  energy  is  directed  back  to  the  radar.  The 
backscatter  can  be  quite  large  at  high  grazing  angles.  At 
the  intermediate  grazing  angles,  scattering  is  somewhat 
similar  to  that  from  a  rough  surface.  At  low  grazing 
angles,  back  scattering  is  influenced  by  shadowing  (mask¬ 
ing)  and  by  multipath  propagation.  Shadowing  of  the 
trough  regions  by  the  crests  of  waves  prevents  low-lying 
scatterers  from  being  illuminated.  [3] 


Figure  3  General  nature  of  the  variation  of  surface  clutter 
as  a  function  of  grazing  angle,  showing  the  three  major 
scattering  regions.  [3] 


High  Grazing  Angle  The  surface  clutter  area  viewed  by 
the  radar  is  determined  by  the  antenna  beamwidths  0b  and 
(1)b,  in  the  two  principal  planes.  The  clutter  illuminated 
area  A^  in  is  (jilA)  R  0b  R  (|)b  /  2  sin  y,  where  y  =  grazing 
angle  and  R  =  range.  The  factor  ti  /  4  accounts  for  the 
elliptical  shape  of  the  illuminated  area,  and  the  factor  of  2 
in  the  denominator  is  necessary  since  in  this  case  0b  and 
(|)b  are  the  one-way  beamwidths.  Substituting  a  =  A^, 

letting  Pj-  =  C  (the  clutter  echo  power),  and  taking  G  = 

0B  the  clutter  radar  equation  in  this  case  is 

71  Pt  Ae 

^  128  R^  sin  y 

The  clutter  power  is  seen  to  vary  inversely  as  the  square  of 
the  range.  This  equation  applies  to  the  echo  power 
received  from  the  ground  by  a  radar  altimeter  or  the 
remote  sensing  radar  known  as  a  scatterometer. 

Mean  and  Median  Values  The  characteristics  of  clutter 
echoes  are  usually  given  in  statistical  terms.  It  is  often 
described  either  by  the  mean  (average)  value  of  a®  or  the 
median  (the  value  exceeded  50  percent  of  the  time).  For  a 


Rayleigh  pdf,  the  difference  between  the  mean  and  the 
median  is  small  (a  few  percent);  but  for  non-Rayleigh 
distributions,  the  mean  and  the  median  can  be  quite 
different. 

For  proper  radar  design,  the  pdf  of  the  clutter  echoes 
should  be  known  so  that  the  receiver  detector  can  be 
designed  appropriately.  Unfortunately,  when  the  clutter 
statistics  cannot  be  described  by  the  classical  Rayleigh 
pdf,  it  is  often  difficult  to  define  a  specific  quantitative 
statistical  description.  For  this  reason,  many  radar  designs 
have  been  based  on  the  mean  value  of  the  clutter  rather 
than  some  statistical  model.  This  single  parameter  model 
is  inadequate  in  many  cases  (i.e.  forest). 

2.3.2  Surface  clutter  models 

Because  clutter  variability  is  often  described  by  a  pdf 
or  PDF,  this  section  describes  several  statistical  models 
that  have  been  suggested  for  characterizing  the 
fluctuations  of  the  surface-clutter  cross  section  per  unit 
area,  or  a^. 

Rayleigh  Distribution  [6]  This  model  is  based  on  the 
assumption  that  there  are  a  large  number  (e.g.  10)  of 
randomly  located  independent  uniform  scatterers  within 
the  clutter  surface  area  illuminated  by  the  radar. 

If  the  radar  receiver  uses  a  linear  detector,  the 
probability  density  function  of  the  voltage  envelope  of  the 
Rayleigh  distributed  clutter  at  the  receiver  output  is 

v>0  (17) 

where  m2  is  the  mean-square  value  (second  moment)  of  the 
envelope  v.  The  mean  mj  (first  moment)  is  (m2  ti  /  4) 
and  the  median  is  (m2  Ln  2)^^^  so  that  the  mean-to-median 
ratio  is  [7r(4  Ln  2)]  =  1.06,  or  0.27  dB.  The  mean  (p)  of 

the  Rayleigh  distribution  is  proportional  to  the  standard 
deviation,  or 

[4 

StDev  =  A  /  ~  -  1  X  p  =  0.523  x  p 

The  Rayleigh  pdf  of  v  normalized  to  its  median  Vni  rather 
than  the  mean-square  value,  is 

p(vn)  =  2  (Ln  2)  exp  t 2)  v^„]  >  q 

where  Vn  =  v  /  v^. 

The  Rayleigh  pdf  also  describes  the  envelope  of  the 
output  of  the  receiver  when  the  input  is  Gaussian  noise. 
The  Gaussian  noise  assumption  applies  to  the  detection  of 
signals  in  clutter  when  the  clutter  statistics  are  Rayleigh 
and  the  clutter  echoes  are  independent  pulse  to  pulse  (as  is 
receiver  noise).  However,  this  is  not  often  true  with 
clutter.  The  noise  voltage  from  a  receiver  is  independent 
from  pulse  to  pulse  since  it  is  decorrelated  in  a  time  1  /  B, 
where  B  =  bandwidth.  The  decorrelation  time  of  clutter 


can  be  much  longer.  This  is  important  when  attempting  to 
integrate  pulses  for  improving  detection  since  clutter 
echoes  are  generally  not  decorrelated  pulse  to  pulse.  To 
apply  Rayleigh  statistics,  the  clutter  might  be  decorrelated 
by  changing  the  frequency  pulse  to  pulse  or  by  waiting  a 
sufficiently  long  time  between  pulses  to  allow  the  tempo¬ 
ral  decorrelation  of  the  clutter  to  occur.  [3] 


measurements  that  lie  between  the  Rayleigh  and  the  log¬ 
normal.  The  Rayleigh  is  actually  a  special  case  of  the 
Weibull;  and  with  appropriate  selection  of  the 
distribution’s  parameters,  the  Weibull  can  be  made  to 
approach  the  log-normal. 

If  V  is  the  voltage  amplitude  of  a  linear  detector,  the 
Weibull  pdf  for  the  normalized  amplitude  v^  =  v  /  Vj^  is 


Log-Normal  Distribution  The  Rayleigh  clutter  model 
applies  when  the  radar  resolution  cell  is  large,  containing 
many  scatterers,  with  no  one  scatterer  dominant.  It  has 
been  used  to  characterize  relatively  uniform  clutter. 
However,  it  is  not  a  good  representation  of  clutter  when 
the  resolution  cell  size  and  the  grazing  angle  are  small. 
Under  these  conditions,  there  is  a  higher  probability  of 
getting  large  values  of  clutter  (higher  “tails”)  than  is  given 
by  the  Rayleigh  model. 

One  of  the  first  models  suggested  to  describe  non- 
Rayleigh  clutter  was  the  log-normal  pdf  since  it  has  a  long 
tail  (when  compared  to  the  Rayleigh).  In  the  log-normal 
pdf  the  clutter  echo  power  expressed  in  dB  is  Gaussian. 
The  log-normal  pdf  for  the  echo  power  when  the  receiver 
has  a  square-law  detector  is  [6] 
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where  5-  =  standard  deviation  of  In  P,  and  Pm  =  median 
value  of  P.  The  ratio  of  the  mean  to  the  median  is  exp 
(sV2).  With  a  linear  receiver,  the  pdf  for  the  normalized 
output  voltage  amplitude  Vn  =  v  /  v^  with  v^  =  median 
value  of  V,  is 


2 

P(vn)  =  exp 

S  Vn 

where  remains  the  standard  deviation  of  In  P. 

The  log-normal  distribution  is  specified  by  two 
parameters  (the  standard  deviation  and  the  median), 
whereas  the  Rayleigh  is  specified  by  only  one  parameter 
(the  mean  square  value).  Log-normal  clutter  is  often 
characterized  by  the  ratio  of  its  mean  to  median.  Based  on 
measurements  by  different  experimenters,  the  mean-to- 
median  ratio  for  ground  clutter  measurement  at  low 
grazing  angles,  it  had  a  value  of  about  2.6  dB. 

It  should  be  expected  that  because  of  its  two  parameters, 
the  log-normal  pdf  can  be  made  to  fit  experimental  data 
better  than  can  the  one-parameter  Rayleigh.  However,  it  is 
sometimes  said  that  the  log-normal  tends  to  predict  higher 
tails  for  the  pdf  than  normally  experienced  with  most  non- 
Rayleigh  clutter,  just  as  the  Rayleigh  model  tends  to 
predict  lower  values. 

Weibull  Distribution  The  Weibull  distribution  has  been 
applied  to  land  clutter.  [10-11]  The  Weibull  distribution  is 
a  two-parameter  family  that  can  be  made  to  fit  clutter 


V(LnvJ^ 


Vn  >0  (20) 


p(vn)  =  a  (Ln  2)  v“ '  ‘  exp  [  '  2)  v“  ]  >  q  ^21) 

where  a  is  a  parameter  that  relates  to  the  skewness  of  the 
distribution  (sometimes  called  the  Weibull  skewness 
parameter),  and  Vj^  is  the  median  value  of  the  distribution. 
When  a  =  2,  the  Weibull  takes  the  form  of  the  Rayleigh; 
and  when  a  =  1,  it  is  the  exponential  pdf.  The  mean-to- 
median  ratio  is  (Ln  2)  “  “  r(  1-H  1/a),  where  r(*)  is  the 

gannna  function.  With  a  square-law  detector,  the  Weibull 
pdf  for  the  power  P  =  is 

p(P„)  =  P  (Ln  2)  ‘  exp  [  '  2)  Pn  ]  >  q  (22) 

2 

where  P  =  a  /2,  =  P  /  Pm,  and  P^  =  v^  is  the  median 

value  of  P. 

Application  of  the  Statistical  Descriptions  of  Clutter 

Caution  should  be  noted  when  using  non-Rayleigh 
distributions  since  they  are  not  based  on  physical 
properties.  They  are  curve-fitting  models  used  to  describe 
experimental  data.  Analyses  in  the  literature  of  the 
detection  of  targets  in  non-Rayleigh  clutter  can  be  found 
that  employ  Weibull  clutter,  [11]  and  log-normal  clutter, 
[9]. 


detector,  normalized  by  the  median  value  v^-  for 
Rayleigh,  log-normal  [with  a  mean-to-median  ratio  of  5 
dB],  and  Weibull  [with  a  skewness  parameter  of  0.5]  [3] 

A  comparison  of  the  Rayleigh  pdf  with  examples  of  log¬ 
normal  and  Weibull  is  shown  in  Figure  4  (The 
equivalence  of  the  Rayleigh  and  the  Weibull  with  a  =  2  is 
noted  in  the  figure.)  The  ability  of  two  parameter 


distribution  like  the  Weibull  to  fit  experimental  data  is 
illustrated  by  the  quite  different  shapes  for  the  a  =  0.5  and 
a=  2  curves. 

PDFs  that  contain  two  parameters,  such  as  the  Weibull, 
can  be  made  to  fit  experimental  data  better  than  can  a 
single-parameter  distribution.  However,  curve  fitting  of  a 
particular  distribution  is  not  always  of  sufficient  help  to 
the  radar  engineer  trying  to  determine  how  to  design  a 
signal  processor  to  detect  signals  in  non-Rayleigh  clutter 
since  a  single  statistical  expression  does  not  do  well  in 
fitting  the  wide  variety  of  real-world  clutter. 


3  Problem  Description 

In  this  paper,  we  are  interested  in  modeling  the  clutter 
observed  from  an  environment  with  various  objects  and 
clutter.  The  clutter  comes  from  the  different  natural 
features  in  the  environment.  Since  we  are  interested  in 
the  desired  targets,  we  model  the  rest  of  the  image  by 
establishing  a  clutter  model. 

3.1  Problem  Statement 

The  problem  we  are  looking  at  is  modeling  forest 
clutter.  There  is  an  associated  scene  from  which  we  are 
trying  to  detect  the  object  and  model  the  various  clutter 
regions  (i.e.  roads,  forest,  grass),  as  shown  in  Figure  5. 


Figure  5.  Scene  of  Interest 


For  targeting  success,  and  because  the  ATR  tasks  have 
narrow  time  windows,  we  need  to  develop  a  clutter  model 
for  rapid  ATR.  We  seek  to  model  the  clutter  where  the 
radar  power  is  proportional  to  the  background  clutter.  By 
assessing  the  radar  image,  we  see  that  there  are  various 
land  clutter  features  that  contribute  to  the  signal  return.  To 
detect  a  target,  we  need  to  effectively  determine  the 
background  clutter  to  increase  the  target- to-clutter  ratio. 
Since  a  defined  distribution  for  clutter  modeling  is  not 
effective,  we  seek  a  fused  clutter  model  based  on  a 
distribution  of  the  clutter  histogram. 


3.2  Modeling 

To  model  the  heavy  tails  of  the  spurious  clutter,  we 
experienced  difficulty  with  the  Rayleigh,  Log-Normal  and 
the  Weibull  for  the  forest  scene.  We  propose  using  a 
Pareto  distribution.  The  distribution  of  a  random  variable 
X  is  said  to  be  heavy  tailed  if  [20] 

1  -  F(x)  =  Pr[A  >  x]  ~  “T  as  X  ^  00,  0  <  a  (23) 

X 

In  general,  a  random  variable  with  a  heavy-tailed 
distribution  exhibits  a  high  or  even  infinite  variance  and 
possibly  an  infinite  mean.  A  random  variable  with  a 
heavy-tailed  distribution  will  include  large  values  with 
non-negligible  probability.  Typically,  if  such  a  random 
variable  is  sampled,  the  result  will  include  many  relatively 
small  values  but  a  few  samples  that  are  relatively  large. 
The  Pareto  distribution  has  successfully  modeled  a  wide 
variety  of  phenomena  from  the  social  and  physical 
sciences  to  communication  theory.  The  Pareto 
distribution  parameters  k  and  a  (k,  a  >  0),  can  be 
represented  with  density  and  distribution  functions 
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and  a  mean  value 

E[X]  =  k 

a  -  1 

(a>l) 

(27) 

The  parameter  k  specifies  the  minimum  value  of  the 
random  variable  and  a  determines  the  mean  and  variance 
of  the  random  variable:  If  a  <  2,  then  the  distribution  has 
infinite  variance,  and  if  a  <  I,  it  has  infinite  mean  and 
variance.  Figure  6  compares  the  Pareto  and  exponential 
density  functions  on  a  log-linear  scale. 


Figure  6.  Pareto  and  exponential  PDFs. 


Note  that  on  this  scale  the  exponential  density  function 
is  a  straight  line,  reflecting  the  exponential  decay  of  the 
distribution.  The  tail  of  the  Pareto  distribution  decays 
much  more  slowly  than  exponential;  hence  the  term  heavy 
tail 

3.3  Fusion  of  Distributions 

The  fused  distribution  approach  requires  some  thought 
associated  with  model  assumptions.  In  most  distributions, 
there  are  associated  moments  and  thus  the  combined 
moment  is  different  than  the  individual  moments. 

We  combine  the  (1)  Rayleigh  and  the  Pareto 
distributions  in  this  study  for  the  clutter  and  the  (1) 
Rayleigh-Log-Normal,  and  (3)  Rayleigh- Weibull  are  not 
shown.  Assuming  we  have  a  distribution  f  i(x)  where  i  = 
1,  . . .,  n  distributions,  we  can  weight  the  distributions 

fpW  =aifi(x)  -H...-HanfnW  (28) 

based  on  the  contribution  of  the  distribution  to  the  reduce 
the  error  between  the  fused  distribution  fpC-^)  and  the 
normalized  histogram  clutter.  The  weights  are  normalized 
values,  where 

ai  =  ai  /  X  a;  (29) 

i=  1 

Another  method  to  combine  distributions,  is  to  determine 
the  effective  range  over  which  one  model  is  more  accurate 
than  another.  Without  extensive  evaluation  of  all  the  land 
clutter,  this  would  prove  ineffective.  Thus,  we  fuse  the 
distributions  to  model  the  average  clutter  region. 

The  statistics,  such  as  the  mean  and  variance,  of  the 
fused  model  are  important  for  clutter  modeling  and  target 
detection.  For  the  mean  and  variance,  we  weight  (in  the 
additive  fusion  case)  them  below: 


The  histogram  analysis  shown  in  Figure  7  allows  us  to 
model  the  clutter  in  the  image  and  develop  the  underlying 
PDF. 


■  H'Mogr-mfvoi'j  a™ 

AllivhLx-iE  M 

P4rVU  iXM  to’ 

UT-^  to-  dfli. 

■  ■* 

tUM  0Mdri:4 

1 .  Di:;i  irni 

v  ■  ;.i 

■  r  - 

1  ' 

elI-^.-i  ■  1 

_  1 

^  ■  -k 

1  _'.r- 

D  1  I  ^  s  z;  Z  3  ^ 

Figure  7.  Scene  of  Interest 
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The  resulting  statistical  parameters  can  be  used  to  model 
the  clutter  for  land  surfaces. 

4  Results 

For  the  results,  we  present  the  standard  clutter  models 
(i.e.  Rayleigh)  followed  by  the  fused  distribution  model. 

For  the  analysis,  we  use  both  the  histogram  analysis  and  a 
quantile-quantile  (Q-Q)  plot.  The  Q-Q  plot  can  provide 
more  insight  into  the  nature  of  the  statistical  difference 
than  analytical  methods  such  as  the  chi-square  and 
Kolmogorov-Smirnov  2-sample  tests. 

Figure  8.  Heavy  tail  deviation 


The  quantile-quantile  (Q-Q)  plot  is  a  graphical 
technique  to  determine  if  two  data  sets  (of  equal  or  non¬ 
equal  size)  come  from  populations  with  a  connnon 
distribution.  A  Q-Q  plot  is  a  plot  of  the  quantiles  of  the 
first  data  set  against  the  quantiles  of  the  second  data  set.  A 
45 -degree  reference  line  is  also  plotted.  If  the  two  sets 
come  from  a  population  with  the  same  distribution,  the 
points  should  fall  approximately  along  this  reference  line. 
The  greater  the  departure  from  this  reference  line,  the 
greater  the  evidence  for  the  conclusion  that  the  two  data 
sets  have  come  from  populations  with  different 
distributions.  To  determine  if  the  reference  image  is  the 
same  as  the  measured  image,  we  show  the  Q-Q  plot. 
Figure  8  shows  that  the  Rayleigh  model  suffers  from 
inadequately  modeling  the  tail  of  the  distribution. 

OOpto* 

4- 

Estin 


Using  the  fused  distribution  approach,  we  determine 
that  we  could  model  the  tail  and  the  general  clutter. 
Determining  the  threshold  for  the  breakpoint  of  the  clutter 
region  and  the  corresponding  tail  would  be  dependent  on 
the  scene.  Thus  we  chose  to  fuse  the  two  distributions  to 
obtain  a  combined  two-parameter  statistical  model. 
Figure  9  and  Figure  10  show  the  desired  fused 
distribution  for  the  image  in  question. 


Figure  9.  Scene  of  Interest 

Figure  10  shows  the  results  of  the  distributions,  without 
the  clutter  histogram.  The  fused  distribution  in  Figure  10 
shows  the  benefits  of  fusing  the  data  as  opposed  to  just 
determining  a  threshold  for  using  independent  models. 
The  resulting  error  between  the  PDFs  is  smaller,  and  we 
can  determine  the  mean  and  variance  of  the  clutter  based 
on  the  image  content.  Furthermore,  we  addressed  the 
corresponding  features  in  the  image  to  weight  the  density 
of  clutter  objects  to  the  model. 
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Figure  10.  Fused  Distributions 


5  Conclusions 

In  this  paper,  we  presented  a  method  for  fusing 
different  distributions  and  applied  the  methodology  for 


characterizing  the  clutter  associated  with  radar  analysis. 
The  method  of  fused  distributions  by  additive  mixture 
modeling  can  be  applied  to  different  systems  so  as  to 
effectively  determine  a  statistical  nature  of  the  system. 
Using  the  example  presented,  future  research  includes 
more  extensive  comparisons  with  the  expectation- 
maximization  algorithm  in  capturing  the  heavy-tail  data 
and  showing  the  distribution  modeling  gain  for  target 
detection. 
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